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Abstract 

Let F be a graph which contains an edge whose deletion reduces its chromatic 
number. We prove tight bounds on the number of copies of F in a graph with a 
prescribed number of vertices and edges. Our results extend those of Simonovits |10j . 
who proved that there is one copy of F, and of Rademacher, Erdos [U [2] and Lovasz- 
Simonovits [4J, who proved similar counting results when F is a complete graph. 

One of the simplest cases of our theorem is the following new result. There is an 
absolute positive constant c such that if n is sufficiently large and 1 < q < cn, then 
every n vertex graph with [n^/4j + q edges contains at least 
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copies of a five cycle. Similar statements hold for any odd cycle and the bounds are 
best possible. 



1 Introduction 



Mantel [S] proved that a graph with n vertices and [n^/4j + 1 edges contains a triangle. 
Rademacher extended this by showing that there are at least \n/2\ copies of a triangle. 
Subsequently, Erdos [H [2] proved that if q < cn for some small constant c, then [n^/4j + q 
edges guarantees at least q[n/2\ triangles. Later Lovasz and Simonovits [4J proved that the 
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same statement holds with c = 1/2, thus confirming an old conjecture of Erdos. They also 
proved similar results for complete graphs. 

In this paper we extend the results of Erdos and Lovasz-Simonovits by proving such state- 
ments for the broader class of color critical graphs, which are graphs that contain an edge 
whose removal reduces their chromatic numbej^ . In many ways our proof is independent of 
the specific structure of F. 

The main new tool we use is the graph removal lemma, which is a consequence of the 
hypergraph regularity lemma (see Gowers [3], Nagle-Rodl-Schacht [8], Rodl-Skokan [9], Tao 
[TT]). In subsequent papers [HlEl we will extend these results to hypergraphs. The novelty in 
this project is the use of the removal lemma to count substructures in (hyper)graphs rather 
precisely. 

We often associate a graph with its edge set. Given graphs F, iJ, where F has / vertices, a 
copy of F in is a subset of / vertices and \F\ edges of H such that the subgraph formed 
by this set of vertices and edges is isomorphic to F. In other words, if we denote Aut{F) to 
be the number of automorphisms of F, then the number of copies of F in if is the number 
of edge-preserving injections from V{F) to V{H) divided by Aut{F). 

Theorem 1. (Graph Removal Lemma [3|, |8l [9|, 111] ) Let F he a graph with f vertices. 
Suppose that an n vertex graph H has at most o{n^) copies of F. Then there is a set of edges 
in H of size o{n^) whose removal from H results in a graph with no copies of F. 

Say that a graph F is r-critical if it has chromatic number r + 1 and it contains an edge 
whose deletion reduces the chromatic number to r. As usual, we define the Turan number 
ex(n, F) to be the maximum number of edges in an n vertex graph that contains no copy of 
F as a (not necessarily induced) subgraph. The Turan graph Tr{n) is the n vertex r-partite 
graph with the maximum number of edges; its parts all have size [n/r] or [n/r\. Let 



n + i 



n + j 



tr{n) = \Tr{n) \ = ^ 

l<i<j<r 

Since x(Tr(n)) = r it does not contain any r-critical graph F, and consequently ex(?7,, F) > 
tr{n). Simonovits [TU] proved that if n is sufficiently large, then we have equality. In other 
words, every n vertex graph (n > no) with tr{n) + 1 edges contains at least one copy of F. 



^Note that a color critical graph is often defined as one with all proper subgraphs having lower chromatic 
number, but our definition is slightly different 
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We extend his result by proving that there are many copies and determining this optimal 
number. In fact, the number of copies is the number one gets by adding an edge to Tr{n). 

Definition 2. Fix r > 2 and let F be an r-critical graph. Then c{n,F) is the minimum 
number of copies of F in the graph obtained from T^{n) by adding one edge. 

Note that for any fixed F, computing c(?7,, F) is just a finite process (see Lemma O in the 
next section). Our result below therefore gives an explicit formula for each color critical F, 
even though this formula may be very complicated. 

Theorem 3. Fix r > 2 and an r-critical graph F. There exists 6 = 6f > such that if n is 
sufficiently large and 1 < q < Sn, then every n vertex graph with tr{n) + q edges contains at 
least qc{n, F) copies of F . 

Theorem [3]is asymptotically sharp, in that for every q < 6n, there exist graphs with tr{n) + q 
edges and at most (1 + 0{l/n))qc{n, F) copies of F. To see this, simply add a matching of 
size q to the appropriate part of Tr{n). Each new edge lies in precisely c{n,F) copies of F 
that contain only one new edge, giving a total of q c{n, F) copies. If F has / vertices, then 
the number of copies of F that contain at least two new edges is at most 0(g^n-^~^). It is 
easy to see that c{n,F) = Q{n^~'^) (see Lemma [5] in the next section for more details), and 
since q < n, we obtain q^n^~'^ = 0{l/n)q c{n, F) as desired. 

In many instances Theorem [3] is sharp. Let us examine two special cases. 

Odd cycles. Fix k > 1 and let F = C2k+i- Then we quickly see that 

c(n,F) = K2j([n/2J - 1) ■ ■ ■ {[n/2\ - k + l){\n/2] - 2)---{\n/2] -k). 

where we interpret the second product as empty if A; = 1. Moreover, if we add a matching 
within one of the parts of T2{n), then it is easy to see that no copy of F contains two edges 
of the matching. Hence Theorem [3] is sharp in the case F = C2k+i- Even the simple case of 
counting Cs's was not previously known. 

minus an edge. Let F be the graph obtained from K/^ by deleting an edge. Then it is 
easy to see that c(n, F) = ('-'^2^-') and again this is sharp by adding a matching to one part. 

For any fixed e > and 1 < q < n^^^, the proof of Theorem [3] actually produces a vertex 
that lies in qc{n, F) copies of F or (1 — e)q edges that each lie in (1 — e)c{n, F) copies of F. 
Such information about the distribution of the copies of F does not seem to follow from the 
methods of [H [21 H] , even for chques. 
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Throughout the paper, Roman alphabets (e.g. r, s, n) denote integers and Greek alphabets 
(e.g. ai^', 7ir, e, 5) denote reals. Given a set of pairs H, let d^iv) be the number of pairs 
in H containing v. So if we view if as a graph, then dniv) is just the degree of vertex v. 



2 Three lemmas 

In this section we will prove three technical lemmas needed in the proof of Theorem [31 
Lemma 4. Suppose that r > 2 is fixed, n is sufficiently large, s < n and rii + ■ ■ ■ + = n. 



then Yn/r\ — s < Ui < [n/r] + s for all i. 

Proof. The result is certainly true for s = by definition of tr{n) and the easy fact that 
X]i<i<j<r^*% maximized when |nj — nj\ < 1 for all i. So assume that s > 1 and let us 
proceed by induction on s. It is more convenient to prove the contrapositive, so assume that 
for some either Ui > [n/r] +s oi Ui < \n/r\ — s and we wish to prove that X]i<j<j<r '^i'^i ^ 
tr{n) — s. We may assume that ni > ■ ■ ■ > Ur and that ni > [ra/r] +s (the case Ur < \n/r\—s 
is symmetrical and has an almost identical proof). Define n[ = ni — l,n'^ = n^ + l and n'^ = rii 
for 1 < 2 < r. Then Yli^'i— certainly have n[ > \n/r~\ + (s — 1) for some i. By the 

induction hypothesis. 




J2 < tr{n) - (s - 1). 



l<i<j<r 



We also have ni > \n/r] + s + 1 and Ur < [n/r\. Consequently, 




l<i<j<.r 



l<i<j <r 




l<.i<j<.r 



l<i<j<r 



< trijl) — (s — 1) — S 

< trijl) — S. 



This completes the proof of the lemma. 



□ 
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Lemma 5. Fix r > 2 and an r-critical graph F with f vertices. There are positive constants 
otF-iPr such that if n is sufficiently large, then 

|c(n, F) - apn-^"^! < 

In particular, {a.Fl'l)n^~'^ < c{n,F) < 2aFn^~'^- 

Proof. We may assume that r\n. Indeed, suppose we could prove the result in the case r\n 
and we are given r that does not divide n. The graph Tr{n) has parts of size [n/rj or [n/r], 
so we can add at most one vertex to r — 1 parts so that all parts have size [n/r]. Also add 
edges from the new vertices to all parts distinct from the one that they lie in. The resulting 
graph is Tr{n') with n < n' < n + r vertices and r\n' . So we have 

c(n,F) < c{n\F) < c{n + r,F) < aF{n + ry~^ + pFin + rY^^ < aFn^~^ + {fraF + 2/3F)n^~^ 

where the last inequality follows since n is large. The theorem therefore holds with aF and 
P'f — f''^'^F + 2i3p. A similar argument gives the required lower bound on c(n, F). 

Let us write an explicit formula for c{n,F). Let H be obtained from Tr{n) by adding one 
edge xy in the first part. Say that an edge uv E F is good if x(F — uv) = r. Let Xuv be 
a proper r-coloring of F — uv such that Xuviu) = Xuv{v) = 1. Every proper r-coloring of 
F — uv gives the same color to u,v, since x{F) > r. Let x[^^ be the number of vertices of 
F excluding u,v that receive color i. An edge preserving injection of F to H is obtained 
by choosing a good edge uv of H, mapping it to xy, then mapping the remaining vertices 
of F to H such that no two adjacent vertices get mapped to the same part of H. Such a 
mapping is given by a coloring Xuv, and the number of mappings associated with Xuv is just 
the number of ways the vertices colored i can get mapped to the ith. part of H. The vertices 
mapped to the first part cannot get mapped to x, y since these have been taken by u, v and 
there are two ways to map {u, v} to {x, y}. Altogether we obtain 

^) - ^ E E 2 (^A - 2).L n 

uv good Xuv *=2 

Expanding this expression, we get a sum of polynomials in n of degree + J2i=2 •''uv = / ~ 2 
with positive leading coefficients. Consequently, c(n, F) is a polynomial in n of degree / — 2 
with positive leading coefficient. Let ap be this coefficient. Since n is sufficiently large, and 
the coefficients of c{n, F) are all fixed independent of n, we can bound the absolute value of 
the contribution of all the other terms by Pf^^'^ for some positive Pf that depends only on 
F. This completes the proof. □ 
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Given integers Ui, . . . ,nr, let c(ni, . . . ,nr,F) be the number of copies of F in the graph 
obtained from the complete r-partite graph with parts rii, . . . ,nr by adding an edge to the 
part of size rii. 

Lemma 6. Let F be an r -critical graph. There is a positive constant 7^ depending only on 
F such that the following holds for n sufficiently large, //rii + • — VUr — n with \n/r\ — s < 
< In/r] + s and s < n/3r, then 



Proof. If s = 0, then the result holds by the definition of c(n, F), so assume that s > 1. Let 
H be the graph obtained from the complete r-partite graph with parts ni > . . . > by 
adding an edge xy to the part of size Ui. Since |ni — n^l < 2s + 1, we can remove at most 
2s + 1 < 3s vertices (excluding xy) from each part of H so that each part has size n^. The 
resulting graph H' satisfies H' — xy = T^in') with n' > n — 3rs. Since n is sufficiently large 
and s < n/3r, we have 



c{ni, . . . ,nr,F) > c{n, F) — 'jpsn- 



f-3 









> n- 



f 



2 



2^rsn^ 3_ 
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Put 'jF = «F2-^r + 2/3^. Lemma [5] now gives 

c(ni, . . . ,nr,F) > c{n — 3rs, F) 

> apin - 3rs)-^~^ - ^pin - 3rs)-^~^ 

> aprJ^'^ - aplWsn^^'^ - (3pn^'^ 

> apn^-^ + f3pn^-^ - {ap2^r + 2/?^)sn-^-3 
= apn^~'^ + (3pn^~^ - 'ypsn^'^ 

> c{n, F) — '^psn^~^ 

and the proof is complete. □ 

3 Proof of Theorem [3 

In this section we will prove Theorem [31 We need the following stability result proved by 
Erdos and Simonovits 1101. 



Theorem 7. (Erdos-Simonovits Stability Theorem |10| ) Let r > 2 and F be a fixed 
r -critical graph. Let H be a graph with n vertices and tr{n) — o(n^) edges that contains no 
copy of F. Then there is a partition of the vertex set of H into r parts so that the number 
of edges contained within a part is at most o(n^). In other words, H can be obtained from 
Tr{n) by adding and deleting a set of o{n'^) edges. 

Remark. The o(l) notation above should be interpreted in the obvious way, namely 
Vr], 3,^,no such that if n > no and \H\ > tr{n) — C.n'^, then H = Tr{n) ± rjn"^ edges. We 
will not explicitly mention the role of C,,ri when we use the result, but it should be obvious 
from the context. We will also assume that ^ is much smaller than t], since if the result holds 
for ^, then it also holds for ^' < ^. Similar comments apply for applications of Theorem [H 

Proof of Theorem [Si Our constants 6i,ei,e will enjoy the following hierarchy: 

1/n <^ 5i ^ ^2 < ^3 < <^4 < £^4 < £^3 < £^2 < £^1 < £^ < 1- 

The notation ^ <^ t] above means that we are applying some theorem with input rj and 
output ^ (the quantification is Vr], 3^) and ^ <^ simply means that ^ is a sufficiently small 
function of rj that is needed to satisfy some inequality in the proof. In addition, we require 

ap 

^<4(7i. + 2/^)' 
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where ap comes from Lemma [5] and comes from Lemma [61 Also, n is sufficiently large 
that Lemmas H] jSl El apply whenever needed. We emphasize that £4 is an absolute constant 
that depends only on F. Set S = e^/iaF and suppose that 1 < g < Sn. Let H he an n 
vertex graph with tr{n) + q edges. Write for the number of copies of F in H. 

If ^F > n^~^^^, then since c{n,F) < 2aFn-^~'^ and q < {e4/4:aF)n, we have 

#F > n-^~^/2 > {e4/4:aF)n{2aFn^'^) > qc{n,F) 

and we are done. So assume that < n^^^^"^ = {l/n^^'^)n^ . Since n is sufficiently large, 
by the Removal lemma there is a set of at most 5in^ edges of H whose removal results in a 
graph H' with no copies of F. Since \H\ > tr{n) — (5in^, by Theorem [71 we conclude that 
there is an r-partition of H' (and also of H) such that the number of edges contained entirely 
within a part is 52^^. Now pick an r-partition ViU . . .UVr of H that maximizes e(Vi, . . . , Vr), 
the number of edges that intersect two parts. We know that eiVi, . . . , Vr) > tr{n) — S2n'^, 
and an easy calculation also shows that each Vi has size rii = n/r ± 5^n. 

Let B (bad) be the set of edges of H that lie entirely within a part and let G (good) be the 
set of edges of H that intersect two parts, so G = H — B. Let M (missing) be the set of 
pairs which intersect two parts that are not edges of H. Then G U M is r-partite so it has 
at most tr{n) pairs and 

tr{n) + q-\B\ + \M\ = \{H - B)UM\ = \GUM\ = \G\ + \M\ < t,(n). 

Consequently, 

g+ |M| < \B\ < 52n^. 

Also, \H\ = \G\ + \B\ so we may suppose that |G| = tr{n) — s and \B\ = q + s for some 
s > 0. For an edge e G 5, let F{e) be the number of copies of F in containing the unique 
edge e from B. 

If s = 0, then G = Tr{n) and F{e) > c{n, F) for every e E B (by definition of c(n, F)) so we 
immediately obtain jj^F > \B\c{n,F) = qc{n,F). 

We may therefore assume that s > 1. Partition B = BiU B2, where 

Bi = {eE B : F(e) > (1 - e)c{n, F)}. 

A potential copy of F is a copy ofFinGUMUB = HUM that uses exactly one edge of 
B. 



8 



Claim 1. \Bi\ > (1 - e)\B\ 

Proof of Claim. Suppose to the contrary that I-B2I > ^l-^l- Pick e G i?2- Assume wlog 

that e C Vi. By Lemma E] (observing that rii = n/r ± and 5^n < n/3r), the number of 
potential copies of F containing e is 

c(ni, . . . ,nr,F) > c{n, F) - '^F{hn)n^~^ > c{n, F) - jpSsn^"'^ > (1 - ^4)c(n, F). 

At least {6/2)c{n, F) of these potential copies of F have a pair from M, for otherwise 

F{e) > c{ni, ...,nr,F)- {e/2)c{n, F) > {1 - - e/2)c{n, F) > (1 - e)c{n, F) 

which contradicts the definition of B2. Suppose that for at least {e/A)c{n, F) of these po- 
tential copies of F, the pair from M misses e. The number of times each such pair from 
M is counted is at most the number of ways to choose the remaining f — 4: vertices of the 
potential copy of F (e is fixed), and then \F\ < p pairs among the chosen vertices so that 
the resulting vertices and pairs form a graph isomorphic to F . There are at most n^~^ 
ways to do this. We obtain the contradiction 



82/' 2/'n/-4 2/'n/-4 " ' ' 



where the first strict inequality follows from Lemma [51 We may therefore assume that for 
at least {e/A)c{n, F) of these potential copies of F, the pair from M intersects e. Each such 
pair is counted at most 2^^n^~^ times, so we conclude that there exists x G e with 

, ,, {e/8)c{n,F) {e/8){aF/2)nf-^ e ap 
dM{x) > ^,.^,_3 > ^J^s = > ^1^- 

Let 

A = {v e V{H) : dM{v) > eiu}. 
We have argued above that every e G -B2 has a vertex in A. Consequently, 

'^'^dB^iv) > 2\B2\ > 2e\B\ > 2e\M\ > e'^dMiv) > e\A\ein, 

and there exists a vertex u & A such that dB^iu) > {eei/2)n > e2n. Assume wlog that 
u G Vi. 

By the choice of the partition, we may assume that u has at least neighbors in Vi for each 
z > 1, otherwise moving utoVi increases e(Vi, . . . , V^). For each z = 1, . . . , r, let V^' be a set 
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of \e2'n\ neighbors of m in and for z = 1, enlarge VI by letting u E V(. Let n'^ = \ V/\. Pick 
V E V{ — {u}. Then the number of copies of F in the complete r-partite graph K{V(, . . . , V^) 
together with edge e = uv is by definition at least 

c{r\e2n],F) > aF{re2ny~'^ - (3F{r\e2n\y'^ > e^n^^'^ 

for suitable £3 depending only on F. Let us sum this inequality over all such e = uv E B2 
with V G V(. Since 6 = e^/Aap and c(n, F) < 2aFn^~'^ , we obtain at least 

{\V{\ - 1)6:^71^''^ > {e2n)ej,n^''^ > e^n^^'^ = ASapn^'^ > Aqapn^''^ > 2qc{n,F) 

potential copies of F containing u. At least half of these potential copies of F must have a 
pair from M, otherwise we are done. This pair from M cannot be incident with u, since u is 
adjacent to all vertices in K{V(, . . . , V^) (other than itself). Hence this pair from M misses 
u. Each such pair is counted at most 2^^!%^'^ times, so we obtain the contradiction 

-n' = < \M\ < 52n\ 



2/2+1 2/'+%/-3 

This concludes the proof of the Claim. □ 

If s > Aeq, then counting copies of F from edges of Bi and using Claim 1 we get 

#F> 5^F(e)> 5^(l-5)c(n,F) 

egBi ee-Bi 

> \Bi\{l-e)c{n,F) 

> {I - ef\B\c{n, F) 

> {l-2e){q + s)c{n,F) 

> {q + 2eq-8e^q)c{n,F) 

> qc{n, F). 

So we may assume that s < Aeq < q < n/?>r. Recall that rii = \Vi\. Then 

tr{n) — S = < UiTlj. 

l<i<j<r 

Now Lemma m implies that for each i, 

\n/r\ — s < Hi < [n/r] + s. 
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Observe that \M\ < s for otherwise \G U M\ > tr{n) which is impossible. Pick e G -B and 
assume wlog that e G Vi. The number of potential copies of F containing e is by definition 
c(ni, . . . ,nr,F). Now Lemma E] implies that 

c(ni, . . . ,nr, F) > c{n, F) — '^psn^~^ . 

Not all of these potential copies of F are in H, in fact, a pair from M lies in at most 2f^nf~^ 
potential copies counted above (we may assume that the pair intersects e otherwise it is 
counted at most 2^^n^~^ times). We conclude that 

F(e) > c(ni, . . . , n^, F) - 2^\f-^\M\ > c{n, F) - -fpsn^'^ - 2^\n^~^. 

Since s < q this implies that 

#F > F(e) >{q + s)(c(n, F) - 'jpsn^'^ - 2f%n^~^) 

eeB 

> qc{n,F) + sc{n,F) - 2q{jpsn^~^ + 2^' sn^~'^). 
As s < q < 6n < en and e < ap/ (4(7i? + 2-^^)), we have the bound 

2g(7j7 + 2^')sn-^-^ < 2e{^F + 2^^)371^''^ < s(aF/2)n-^"^ < s c{n, F) 
This shows that ^F > q c{n, F) and completes the proof of the theorem. □ 
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